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O ■ ABSTRACT 

<N ■ 

The recent publications of the DENIS Catalogue towards the Magellanic Clouds (MCs) with more than 1.3 million 
sources identified in at least two of the three DENIS filters (I J Kg) and of the incremental releases of the 2MASS 
point source catalogues (J H Kg) covering the same region of the sky, provide an unprecedented wealth of data 
related to stellar populations in the MCs. In order to build a reference catalogue of stars towards the Magellanic 
Clouds, we have performed a cross-identification of these two catalogues. This implied developing new tools for cross- 
identification and data mining. This study is partly supported by the Astrovirtel program that aims at improving 
access to astronomical archives as virtual telescopes. The main goal of the present study is to validate new cross- 
matching procedures for very large catalogues, and to derive results concerning the astrometric and photometric 
accuracy of these catalogues. The cross-matching of large surveys is an essential tool to improve our understanding 
of their specific contents. This approach can be considered as a new step towards a Virtual Observatory. 
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The Magellanic Clouds (MCs) have been recently fully observed by two major infrared surveys : the Deep Near 
Infrared Survey of the Southern Sky - DENIS0 and the Two Micron All Sky Survey - 2MASS.I A Near Infrared Point 
Source Catalogue towards the Magellanic Clouds, based on DENIS data, has been published (hereafter DCMCB). 
The catalogue covers an area of 19.87 x 16 degrees centered on the Large Magellanic Cloud (LMC), and an area of 
14.7 x 10 degrees for the Small Magellanic Cloud (SMC). To compute this catalogue, the objects were required to 
be detected in at least two of the three DENIS bands l(Gunn - i,0.79/j,m), J(1.22/zm), K s (2.15^m). The 2MASS 
observed the whole Magellanic Clouds in three photometric bands : ,J(1.23/j,to), H(1.63^im) and Ks(2.15/xm). Most 
of the data are available from the Second Incremental Release PSCp except for small gaps in regions crossing the 
LMC and SMC bars and around bright stars (Fig. 111). 

The Magellanic Clouds are one of the best places to study stellar evolution because of their proximity and common 
distance of their constituent objects. Near infrared surveys provide interesting data for this kind of study because 
of their insensitivity to dust reddening. The number of sources from both surveys are recorded in Table [j]. Because 
of different sensitivity limits, DENIS sources detected only in the I and J bands are often detected in H and Ks 
by 2MASS. The 2MASS observations reach almost one magnitude fainter than DENIS in the Kg channel, while 
they are roughly equivalent in the J channel (Fig. ||). So it would be interesting to cross-match the two catalogues 
to complete the spectral range of the DCMC IJ-sources with the H and Kg bands coming from 2MASS. Thus, 
cross-identification of the DCMC and 2MASS catalogues will provide an unprecedented basis for study of stellar 
populations in the Magellanic Clouds and for further cross-identifications with catalogues at other wavelengths. 
Furthermore the Clouds are a good place to develop and test cross-matching procedures for dense and large regions 
of the sky. 

Further author information: (Send correspondence to N.D.) 
N.D.: E-mail: delmotte@astro.u-strasbg.fr 
D.E.: E-mail: egret@astro.u-strasbg.fr 
C.L.: E-mail: loup@iap.fr 
M.R.C.: E-mail: mrcioni@strw.leidenuniv.nl 
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Figure 2. Completeness diagrams for the LMC. 2MASS observations are deeper than DENIS ones in the Kg band. 

2. DATA ORGANIZATION 

The present work is based on public data from 2MASS, as given in Sect, [jj DCMC data have been obtained from a 
local copy of the catalogue that includes the missing strips of the first release (the second release is currently under 
process). Before running the cross-matching programs, we organized the raw data, splitting both catalogues into 
smaller pieces. The DENIS observational strategy has been to divide the sky in strips of 30° in Declination (DEC) 
and 12' in Right Ascension (RA). To define subsamples, we adopted a strip by strip strategy because : 

• Our cross-matching algorithm is well adapted to data files with small extension in RA (alpha) and a strip is 
only 12' large in RA. 

• The cross-matching criteria depend on the strip number as explained below (see Sect. ^). 

First we split the DCMC catalogue by strip number. There are 119 strip-files for the LMC and 88 strip-files for the 
SMC. Then for each strip-file, we extracted from the 2MASS data all the point sources overlapping the same region 
of the sky. 

The cross-matching program is run for each strip. Each time we have two input files, one from DCMC, one from 
2MASS, corresponding to a given strip number. Both files have been previously sorted by ascending declination, in 
order to optimize the cross-comparison procedure. The procedure can be described as follows : both files are read 
sequentially in parallel ; for each record of the first file (say, DCMC) we search for all possible cross-matches in 
the second file (here 2MASS). For that, we read the second file and keep in memory a buffer of possible candidates, 
making sure that the highest value of the declination in file 2 is actually higher than 5 + 5s of the current record 
of file 1, and the same thing for the lowest value of the buffer of file 2, which has to be lower than 5 - 5s- Possible 
cross-matches are kept, together with the corresponding differences in positions and magnitudes. In a first run, we 
keep the smallest difference in position as the most probable, using a box of S a , 5s = 10". 

3. FIRST CROSS MATCHING STEP : FINDING DISCREPANCIES IN THE 

ORIGINAL CATALOGUES 

The easiest way to find matches between two catalogues is to fix a searching box in position of a few arcseconds and 
compare the coordinates. It will work really well in most cases because the astrometry of the two DCMC and 2MASS 
catalogues is accurate enough (better than one arcsecond). Furthermore both catalogues were calibrated upon the 
USNO-A2.0 catalogued Consequently the distance match is better than 0.5" for the great majority of stars. There 
is in principle no risk of confusion at such a small scale. While this is true in general, in practice the cross-matching 
exercice has proven to be a powerful tool to detect subsets of the data files which deviate from the perfect situation, 
and primarily areas suffering from problems in the astrometric or photometric calibration. Here is what we did to 
find out these regions : 
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Figure 3. a and 5 are in degrees. They span the central part of the SMC. Each dot corresponds to one cross- 
identification with a distance larger than 1". The geometric patterns indicate that subsets (images and strip overlaps) 
of the DCMC present errors in their astrometric calibration. 



• We split each catalogue into big chunks : 5° width in RA for the LMC and 10° width in RA for the SMC. 

• We ran a cross-matching program based on distances only. That is for one DCMC source, we searched in 
2MASS for all the possible matching sources in a radius of 10". 

• Between all the possible associations found, we kept only the association with the closest distance. 

• Then we made a map in [a, S] of the distances to the closest neighbours : 

— Each cross-identification is marked with a dot in the [a, S] plane. 

— The color of each dot depends on the distance of the cross-identification. 

Fig. H shows the distance map for the central part of the SMC. We plotted only the cross-identifications with distances 
larger than 1". They cannot be taken for random associations because geometrical and well-defined patterns appear 
on the map. These patterns reveal problems in the astrometry for a few DCMC images and along the border of 
several strips. We found two main reasons to explain these results : systematic distance shifts associated with 
redundant DCMC sources and non-systematic effects dealing with field distortions. 

3.1. Redundant Sources 

Redundant sources are located on the overlaps with adjacent strips, and with adjacent images of the same strip. Fig. ^ 
shows the Aladin view of a LMC region containing redundant DCMC sources. AladinLj is an interactive software 
sky atlas developed by CDS, allowing one to visualize digitized images of any part of the sky and to superimpose 
entries from astronomical catalogues. Redundant DCMC sources are systematically shifted by 5" in declination above 
2MASS sources. This problem mainly occurs in crowded regions where the number of USNO-A2.0 reference stars is 
small because of the confusion. It can happen that an astrometric reference star was incorrectly cross-identified with 
a DENIS source, leading to a systematic shift in RA or/and in DEC. It usually affects only one image, sometimes a 
few adjacent images. The DENIS sources located in the overlaps with the adjacent images of the same and adjacent 
strips were thus not properly cross-identified with the DENIS sources of those adjacent images. 

To find out the consequences of these redundant sources on the cross-matching, we took an area of 3 x 2.7 degrees 
in the LMC with -71° < 5 < -68° and 75° < a < 82.5°, and including redundant sources. 

First we made a histogram of the distances to the closest neighbours (Fig. I| (a)). The physical associations are 
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Figure 4. Aladin view (MAMA scan of an ESO plate) of a LMC region containing redundant DCMC sources 
shifted by 5" in declination (crosses). 2MASS entries are plotted as squares. These redundant sources are always 
located on the overlaps between adjacent images and strips. 

located on the left part of the histogram (distance < 2"), whereas the non-physical random associations are on the 
right part. This general feature is complemented here by a rather striking effect : a bump is clearly visible around 5". 
Then to understand better what was going on, we considered a and 5 separately (Fig. |^ (b)). Each point represents 
one association between DCMC and 2MASS. On the x-axis, we have : 

("2MASS ~ "DCMC) x C0S(5 DCMC 

and on the y-axis : 

5 2MASS ~ 5 DCMC" 

If there is no significant shift between DCMC and 2MASS, all points should be centered around (0,0), which is 
the case for the great majority of stars. But we can see another cluster of points around (0,-5) which corresponds 
to the relative shift of the redundant sources (5" in <5, 0" in a). To characterize more precisely the faulty images, 
we took all the cross-identifications with distances between 4.5" and 5.5" and we plotted them in [a, 5] (Fig. || (c)). 
They appear to be well located inside a square region of the sky : the images 70 and 71 of the strip 4969. All images 
affected by redundant sources were discarded for the following of the procedure. 12 images are concerned in the 
LMC (0.1%) and 42 in the SMC (0.8%). 

3.2. Field Distortions 

Field distortions in the DCMC affect the quality of the astrometry. To detect them, we proceeded strip by strip as 
follows : 



• We kept only well confirmed DCMC sources : 10.5 < I < 16.5 and flags in the I band equal to zero. 
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Figure 5. (a) (Top left) Histogram of distances in arcseconds between DCMC and 2MASS matched sources, 
(b) (Top right) Each cross-matched source is marked with a dot. 6a and 66 are in arcseconds. (c) (Bottom) a and 
6 are in degrees. Each point corresponds to one cross-identification with 4.5" < distance < 5.5". 



• We ran a cross-matching program based only on distances, with a searching box that goes up to 30". 

• Between all the possible associations found, we kept only the association with IJqcmc — ^2MASSl ^-5. ^ ne 
selection is done on magnitude because in case of field distortions small distances are not a reliable enough 
criterion. 

As an example, the results of the cross-identification for strip 6938 are summarized in Fig. ||. The relative shifts 
6a and 66 are a function of the pixel coordinates (x,y) of the camera. The maximum shift in a between DCMC and 
2MASS goes up to 3.5". We found 11 and 14 strips affected by field distortions at a level larger than 2" in the LMC 
and SMC, respectively. One of them (strip 5830 in the SMC) had to be rejected because of erroneous astrometric 
calibration. 

3.3. Photometry 

We also searched for a systematic shift in magnitude between the DCMC and 2MASS for the J and Ks bands. We 



used the cross-identifications coming from Sect. 3.2. Mean shifts in J and Ks have been computed for each strip. 
The results presented on Fig. are for the strip 6938 (slot 4034). The magnitude shift is -0.07 for the J band and 
-0.24 for the K s band. 

4. CROSS MATCHING CRITERIA 

The astrometric and magnitude shifts depend on the strip and have to be taken into account. Strategies for coping 
with them have been implemented, to allow a proper strip by strip cross-matching of both catalogues. For each 
DCMC source of a given strip, we search the best 2MASS association using both position and magnitude criteria : 
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Figure 6. Typical example of field distortion, especially along the x-axis of the camera. The results presented here 
are for the strip 6938 located in the SMC. 




Figure 7. Magnitude histograms for cross-matched sources of strip 6938. (Left) Jdcmc — J2MASS histogram. 
(Right) Ks DCMC -Ks 2MASg histogram. 



Selection on coordinates : The shifts in a and 5 vary inside the images of the strip but distortions do not 
vary significantly along the strip. They are approximately the same for all the images of the strip. So it is 
better to use the statistics of the whole strip instead of one single image. Thus we can define a specific searching 
box for the strip. The size of the box will take into account the shifts in a and 5 found for this strip number 
as explained in Sect. 3.2. The default size of the searching box when there are no shifts is 3". So we have now 
an enlarged and assymetric searching box : 



^ a min - 3 " <5a m ax + 3" 

cos ,5 < "DCMC ~ "2MASS < COS(5 

SS min < S BCMC ~ ^2MASS < ss max > 

where <5a: mm , <5a m ax> <^niin' ^max axe the minimum and maximum shifts in right ascension and declination. 

2. Selection on magnitudes : Between all the possible associations found in step 1, we must keep the best one. 
We have seen that keeping the association with the smallest distance is no more a reliable criteria because of 
field distortions. So we have to check the compatibility in magnitude for each association, after applying on 



the strip data the associated mean magnitude shifts < 5J > and < <5Kg > computed in Sect. 3.3 



• If Ks is not detected in one or both catalogue, the selection is done on J. The following relation has to be 
true to keep the association : 



\53- < 53 > I < w x Jai + a\ , 

V J DCMC J 2MASS 

where w = 2 is a weight, and en and ctt are the relative photometric uncertainties as quoted 

° J DCMC J 2MASS 

in both catalogues. Relative uncertainties are in general very small for bright stars, less than 0.01 mag. 
However, uncertainties on the absolute calibration are much larger : about 0.1 mag for the DCMC. If we 
apply abruptly the above criterion, we will lose many cross-identifications for the stars with small relative 
uncertainties. We thus need to refine the selection criterion and consider two cases : 



if wx./a 2 , +a 2 1 < A3 then \S3- < S3 > I < A J , else 

J DCMC J 2MASS 



if w x , /ctt + (It > AJ then \SJ- < S3 > I < w x . <j\ + a\ , 

V J DCMC J 2MASS V J DCMC J 2MASS 

where AJ = 0.45 is the maximum width of the <5J distribution. 

• If J is not detected in one or both catalogue, the selection is done on Ks as above but this time we have 
AK = 0.60. 

• If J and K s are detected in both catalogues, the selection is done on J and then on J-K s . 

• If J and Ks are not detected in one or both catalogue, the association is lost. 

3. Applying these criteria, if there are still more than one possible association for one DCMC source, then we 
keep the association with the smallest S3 or <5K S . 



5. RESULTS 

Running the cross-matching programs took about two hours for each Cloud on a Unix station. For each strip, we 
computed the percentage of DCMC sources matched with 2MASS. Results are summarized in Table 2. Nearly 80% 
of the LMC strips and 70% of the SMC strips have a match rate better than 90%. The 16 LMC and 18 SMC strips 
with a match rate smaller than 80% correspond to the gaps in the 2MASS data. The merged point sources have at 
least two of the four photometric bands and J or Ks is always present because it was the magnitude link between 
the two catalogues. Table 3 lists the number of merged sources as a function of detected wavebands. When J or Ks 
is present, it comes either from the DCMC or 2MASS, or from both. 

86% and 83% of the DENIS point sources are matched with 2MASS for the LMC and SMC, respectively. The 
number of stars in common is 1252700 in the LMC and 278856 in the SMC. 



Table 2. Number of strips per match rate. 



LMC 


SMC 


Match rate 


Number of strips 


Match rate 


Number of strips 


>95% 


42 


> 95% 


30 


[90%, 95%] 


52 


[90%, 95%] 


30 


[80%, 90%] 


9 


[80%, 90%] 


10 


< 80% 


16 


< 80% 


18 


Total 


119 


Total 88 



Table 3. Merged DENIS and 2MASS point sources. 



LMC 


SMC 


IJ 


3 


IJ 





IK S 





IKs 





JK S 


10 


JK S 


1 


IJKs 


58 


IJKs 


16 


IJH 





IJH 





IHK S 





IHK S 





JHKs 


699 


JHK S 


185 


IJHKg 


1251930 


IJHKs 


278654 


Total 


1252700 


Total 


278856 



We tried to find a mean relation between DCMC and 2MASS magnitudes, restricting to the range [10,14] in J and 
[8,12] in K s , avoiding saturated bright stars as well as the faintest ones. There is a systematic shift of the absolute 
calibration between the two catalogues (Fig. ^). For each strip, we calculated the median of 53 and SK S . Fig. [l(] 
shows the histograms of all the shifts found. The mean systematic shift between the two catalogues is -0.10 in J and 
-0.14 in K s . 

New color-magnitude diagrams (CMDs) produced out of the merged catalogue are presented as in Fig 11. 

6. CONCLUSION 

The work presented hera-is an intermediary step before the production of a Master Catalogue of stars towards the 
Magellanic Clouds (MC2EI) which is to appear at the end of 2001. The Master Catalogue should also include cross- 
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Figure 8. Completeness diagrams for the merged DENIS and 2MASS point sources. 
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Figure 10. Histograms of the magnitude shifts found for the 119 LMC strips and 87 SMC strips. 
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Figure 11. CMDs for both Clouds (original in colors). There are 1251988 sources for the LMC and 278670 
for the SMC. We used the I band coming from the DCMC and the K band coming from 2MASS. Note that those 
observations were not simultaneous so these CMDs should be considered as indicative diagrams. 

identifications with catalogues and tables at other wavelengths : GSC-II (optical), MSX and IRAS (far infrared). 
This reference catalogue will be made available as a support for a number of studies concerning, e.g. the stellar 
populations in the Magellanic Clouds, the structure of the Clouds, or certain classes of objects (X]epheids, AGB stars, 
etc.). Recent articles, such as those by Nikolaev & Weinberg (2000)a and Cioni et al. (2000)Ej have demonstrated 
the power of near infared surveys to improve our understanding of those neighbouring galaxies. 
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